Rank | Count | Beginning |
---|---|---|
8033 | 508 | У |
4407 | 247 | На |
38 | 241 | А |
2453 | 234 | За |
3230 | 189 | І |
229 | 169 | Але |
9628 | 157 | Як |
8917 | 153 | Це |
6386 | 139 | Про |
862 | 131 | В |
9577 | 101 | Я |
2105 | 100 | До |
1193 | 89 | Він |
2441 | 86 | З |
9819 | 84 | Якщо |
7436 | 80 | Також |
4137 | 71 | Ми |
2033 | 62 | Для |
3850 | 62 | Крім |
7849 | 62 | Тому |
4453 | 60 | Нагадаємо, |
5841 | 59 | Після |
1503 | 58 | Вони |
1442 | 57 | Вона |
6274 | 52 | При |
5006 | 50 | Не |
5290 | 49 | Однак |
3712 | 42 | Коли |
129 | 41 | Адже |
5767 | 41 | Під |
In the next four subsections show the most frequent sentence beginnings consisting of N words, N=1, 2, 3, 4. In this subsection we start with N=1.
The most frequent word-N-grams at the beginning of sentences give some insight into sentence composition.
Especially for N=1, we only need a small corpus to identify the most frequent sentence beginnings.
select substring_index(sentence, ' ', 1) as beg, count(*) as cnt from sentences group by substring_index(sentence, ' ', 1) order by cnt desc limit 50;
4.3.1.2 Most Frequent Sentence Beginnings II
4.3.1.3 Most Frequent Sentence Beginnings III
4.3.1.4 Most Frequent Sentence Beginnings IV
4.3.1.1 Most Frequent Sentence Endings I
4.3.1.2 Most Frequent Sentence Endings II
4.3.1.3 Most Frequent Sentence Endings III
4.3.1.4 Most Frequent Sentence Endings IV